Interfaces to Support the Scholarly Exploration of Text Collections
نویسندگان
چکیده
The analysis of text collections forms the basis of scholarship in many disciplines in the humanities and social sciences. Despite the growing availability of electronic texts, automated techniques have not been effectively exploited to support the activities of scholars in these fields. We present a prototype search interface for exploring text collections that places equal emphasis on content, what the document is about, and metadata, the context that situates a piece of text. As a start, we focus on a selection of briefs and opinions from the U.S. Supreme Court to support legal scholars.
منابع مشابه
Topical Categorization of Large Collections of Electronic Theses and Dissertations
Electronic Theses and Dissertations (ETDs) form an important part of scholarly work. Many universities in the USA, and other parts of the world, require their students to submit their theses and dissertations in electronic form. The ETDs are hosted by the respective universities, and no single point of access exists to the different ETD collections. Various initiatives like NDLTD have aimed to ...
متن کاملTopicViz: Semantic Navigation of Document Collections
When people explore and manage information, they think in terms of topics and themes. However, the software that supports information exploration sees text at only the surface level. In this paper we show how topic modeling – a technique for identifying latent themes across large collections of documents – can support semantic exploration. We present TopicViz, an interactive environment for inf...
متن کاملParaText: Scalable Text Modeling and Analysis pdfkeywords
Automated analysis of unstructured text documents (e.g., web pages, newswire articles, research publications, business reports) is a key capability for solving important problems in areas including decision making, risk assessment, social network analysis, intelligence analysis, scholarly research and others. However, as data sizes continue to grow in these areas, scalable processing, modeling,...
متن کاملFaceted Browsing of Text Collections
Faceted navigation is a proven technique for exploration and discovery of a resource collection. In this paper, we report on a visual support toward the exploration of a collection of documents based on a set of entities of interest to users, in which faceted navigation is employed for the filtering process. Our approach can be used when metadata is not available and unlike other faceted browsi...
متن کاملExploration of Full-text Databases with Self-organizing Maps
Availability of large full-text document collections in electronic form has created a need for intelligent information retrieval techniques. Especially the expanding World Wide Web presupposes methods for systematic exploration of miscellaneous document collections. In this paper we introduce a new method, the WEBSOM, for this task. Self-Organizing Maps (SOMs) are used to represent documents on...
متن کامل